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BACKGROUND OF THE INVENTION 



The present application claims priority to U.S. Provisional Patent Application 
Serial No. 60/182,060, filed February 11, 2000, the entire text of which is specifically 
5 incorporated by reference herein without disclaimer. 

A. Field of the Invention 

The present invention relates generally to the fields of protein chemistry, 
biochemistry, organic chemistry and molecular biology. More particularly, it concerns a 
10 novel screen for identifying peptides having target binding affinity. 

B. Description of Related Art 

With the end of the human genome project in sight, the next great challenge in 
human biology will be to deduce the function of the thousands of new gene products 

15 identified by the sequencing effort. Perhaps the most direct way to take advantage of 
knowing the sequences of these proteins would be to design compounds capable of 
binding specific epitopes in a factor of interest. These binding agents could then be used 
for a variety of purposes, including affinity purification of the protein or the manipulation 
of its post-translational modification (see below). 

20 Unfortunately, peptides, or peptide epitopes in proteins, are difficult targets for 

molecular recognition studies in aqueous solution. An unstructured peptide, or even one 
with a well-defined secondary structure, does not present to a prospective ligand the kind 
of molecular crevices and canyons present in a typical globular protein or in a double 
helical nucleic acid, which shield interacting groups from competition by solvent water. 

25 Indeed, looking at the prior art, it was not clear that it would be possible to form stable, 
highly specific, non-covalent complexes between peptide targets and small molecules in 
water. Leucine zipper domains are the smallest known naturally occurring motif that 
supports sequence- specific interactions, and these moieties are comprised of 30 or more 
amino acids (Landschulz et al., 1990; O'Shea et al., 1992) there are no natural examples 

30 of short peptide sequences that are independently capable of mediating protein-protein 
interactions, 
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There has been considerable interest in the organic chemistry community in 
developing synthetic peptide receptors. While some elegant chemistry has been 
performed, the receptors that have evolved from these efforts are far from being 
practically useful to biologists. Still and co-workers searched synthetic combinatorial 
5 libraries of cyclic amides for molecules able to bind tripeptides and incorporated these 
into fluorescence-based sensor systems (Burger and Still, 1997; Chen et al., 1998; Cheng 
et al., 1996; Shao and Still, 1996; Still, 1996). Complexes with K D s of 20-40 \M were 
obtained, but these receptors and the target peptides only associated in chloroform. The 
literature on receptor-peptide binding in water is even less developed. For example, 

10 Hossain & Schneider (Hossain and Schneider, 1998) describe tripartite receptors for di- 
and tripeptides comprised of a crown ether and a quaternary amine separated by a 
hydrophobic spacer. These bind hydrophobic di- and tripeptides in water with K D s in the 
mM to (iM range, and very modest sequence specificity. These receptors were designed 
specifically to utilize the charged N- and C-terminal ends of a small peptide in binding, 

15 and so would not be applicable to the recognition of peptide epitopes in proteins. 

The only report in the literature of peptide-like epitope-binding molecules that 
function in water is that of Nestler and co-workers at Cold Spring Harbor (Dong et al., 
1999). These workers employed synthetic, bead-bound libraries of "forcep" molecules as 
potential epitope receptors. These forceps were comprised of multiple copies of a single 

20 peptide or peptide-like molecule displayed on either a stiff or floppy molecular scaffold. 
Forceps that bind an epitope from Ras were isolated and shown to be able to inhibit the 
farnesylation of Ras when present at high concentrations. However, the Nestler study was 
restricted to these homo-oligomeric forceps, which bind their target weakly. It was not 
demonstrated that this is a system of general utility, nor was it demonstrated that simple 

25 epitope-binding peptides lacking an elaborate superstructure could be isolated. Finally, 
the method employed by Nestler and co-workers requires a synthetic target molecule. 

At present, the only class of molecules generally useful for peptide recognition in 
water is antibodies. Antibodies are, of course, proteins; not low molecular weight 
compounds. They are relatively fragile compared to small molecules. Using classical 

30 methods, they are tedious and expensive to obtain, particularly in large quantities, 
although advances in the construction of single chain antibody libraries on phage 
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(Griffiths and Duncan, 1998; Rader and Barbas, 1997) promise to speed up this process. 
Finally, antibodies are not easily rendered cell-permeable. 

Another major issue with antibodies is one of scale. For example, immunoaffmity 
purification of multi-protein complexes containing an epitope-tagged protein is an 
5 important tool in probing the biochemistry of splicing, transcription, DNA replication and 
many other critical cellular processes. However, they are extremely expensive to carry 
out on a large scale since the commercially available monoclonal antibodies used for 
these studies sell for more than $100 per milligram. Furthermore, the antibody-epitope 
interaction is generally so tight, that yields of only 1-10% are realized when one attempts 

10 to elute the tagged complex from the column with synthetic epitope peptide. This also 
makes reuse of the resin very difficult. This technology is good enough to provide 
quantities sufficient for identification of the component polypeptides by microsequencing 
techniques, but it is difficult to prepare enough material for extensive biochemical 
analysis, let alone structural studies. Thus, a major goal in the proteomics area is to 

15 develop synthetic epitope receptors that function in water and bind their targets 
selectively with modest to very high affinities. 

In order to achieve all the goals necessary for such projections, the following 
attributes are required of the desired affinity reagent. First, the epitope-binding molecule 
(EBM) must bind to a given epitope (sequence of about 5-15 amino acids) in a protein of 

20 interest with very high specificity. Furthermore, it is critical that the inventor is able to 
choose the epitope. Screens carried out using intact recombinant protein targets almost 
invariably identify molecules that recognize natural interaction sites, probably because 
these represent the most "bindable" surface of a protein (Fairbrother et al, 1998; Zhu et 
al., 2000). However, the inventor wishes to isolate native complexes where these 

25 interaction surfaces will generally be occupied, so the affinity reagent must recognize 
some other surface of the protein target. 

Second, EBMs of both moderate and high affinity must be available. High 
affinity EBMs will be of utility in the construction of biosensors (Kodadek, 2001) and in 
other applications as antibody replacements. However, for the affinity purification of a 

30 protein and its associated factors, EBMs of modest affinity will be much more useful, 
since the target protein can be eluted much more efficiently from a modest affinity EBM 
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under native conditions than from a tight-binding antibody. Based on data that will be 
discussed below, the inventors believe that for affinity chromatography applications, a Kd 
between 10" 4 M and 10" 7 M would be ideal (Zhang et al., 2000). For biosensor 
applications, EBM-protein complexes with KdS of 10" 9 M or below are desired. 
5 Third, the EBM must be a relatively small molecule that can be synthesized in at 

least milligram, and preferably gram, quantities. Fourth, the screen used to identify the 
EBM must be relatively rapid and convenient, so as to be capable of supporting high- 
throughput. Thus, it is clear that there remains a considerable need in the field for a 
reagent with all of foregoing properties. 

10 

SUMMARY OF THE INVENTION 

Thus, in accordance with the present invention, there is provided a method for 
identifying a peptide-peptide interaction comprising (a) providing a first fusion construct 

15 comprising target peptide fused to a first DNA binding domain; (b) providing a second 
fusion construct comprising a library encoded peptide (LEP) fused to second DNA 
binding domain (DBD), wherein said second DBD works as a complex with said first 
DBD to facilitate binding of said complex to a prokaryotic operator region; (c) contacting 
said first and second fusion constructs in a prokaryotic host cell which comprises said 

20 prokaryotic operator region, wherein said prokaryotic operator region is operationally 
linked to a coding region for one or more indicator polypeptides; and (d) determining 
binding of said complex to said operator region, whereby binding of said complex to said 
operator region identifies said LEP as a binding partner for said target peptide. 

The binding of the complex to the operator, in one embodiment, acts blocks the 

25 transcription of said coding region, for example, to render said prokaryotic host cell 
insensitive to phage infection. In this embodiment, step (d) may comprise infection with 
a phage that infects, replicates and lyses said prokaryotic host cell. For example, the 
operator may be the lacZ operator, and the first and second DBDs are derived from the X 
repressor. 

30 The one or more indicator polypeptides may produce a colorimetric or fluorescent 

product, or be an enzyme like P-gal. The target peptide may be 5 to about 5000 residues 
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in length, or 10 to about 2000 residues in length. The LEP may be 5 to about 50 residues. 
The first and second fusion constructs may be encoded by a nucleic acid segment under 
the control of a promoter operable in said prokaryotic host cell. 

The target peptide and LEP may bind with an affinity in the range of about 10" to 
5 about 10' 6 M, in particular with an affinity in the range of about 10" 4 M, about 10" 5 M, or 
about 10" 6 M. 

The method may further comprise random mutagenesis of said LEP, followed by 
measuring the change, if any, in the binding affinity of said LEP for said target. The 
method may also comprise measuring binding of said LEP to the target peptide under a 

1 0 second set of more stringent conditions. 

The method may further comprise (e) linking the identified LEP to a third peptide, 
whereby linking permits said identified LEP and the third peptide to interact 
independently with the target peptide; (f) then contacting the target peptide with the 
identified LEP-third peptide complex, and (g) followed by determining the change, if 

15 any, in the binding affinity of said LEP for the target peptide. Again, the measuring may 
also comprise measuring binding of said LEP to the target peptide under a second set of 
more stringent conditions. 

The third peptide may be known to bind said target peptide, may be a member of 
a peptide or peptidomimetic library, may be an enzyme substrate, an antigen, or a 

20 eukaryotic cell antigen. For example, the enzyme substrate may be a bacterial, viral or 
fungal antigen. The eukaryotic cell antigen may be a tumor cell marker, an HLA antigen, 
a cell surface receptor, or a cell surface transporter. 

In another embodiment, the method comprises, prior to the determining, the step 
of stabilizing the interaction between the target peptide and said LEP, for example, by 

25 cross-linking or phototrapping. The first peptide also may comprise a multimer of a 
smaller peptide unit. 

The method may further comprise assessing binding of the target peptide to said 
identified LEP by Western blot, mass spectroscopy, or nuclear magnetic resonance. 

In another embodiment, there is provided a method for screening a peptide library 
30 for peptide-peptide interactions comprising (a) providing a plurality of a first fusion 
construct comprising a target peptide fused to a first DNA binding domain; (b) providing 
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a plurality of second fusion construct comprising a library of encoded peptide (LEPs) 
fused to second DNA binding domain (DBD), wherein the second DBD works as a 
complex with the first DBD to facilitate binding of the complex to a prokaryotic operator 
region; (c) transferring said pluralities of first and second fusion constructs into a 
5 prokaryotic host cell which comprises said prokaryotic operator region, wherein said 
prokaryotic operator region is operationally linked to a coding region for one or more 
indicator polypeptides; and (d) determining binding of complexes to the operator region, 
whereby binding of the complexes to the operator region identifies associated LEPs as 
binding partners for the target peptide. Steps (a)-(d) may be repeated at least once using 

10 the LEP identified in step (d). The LEPs, in one embodiment, may be synthesized from a 
four base cutter-digested DNA library. The method may also further comprise the step of 
sequencing a DNA encoding an identified LEP. 

In still another embodiment, there is provided a library encoded peptide (LEP) 
selected according to a method comprising (a) providing a first fusion construct 

15 comprising target peptide fused to a first DNA binding domain; (b) providing a second 
fusion construct comprising the LEP fused to second DNA binding domain (DBD), 
wherein the second DBD works as a complex with the first DBD to facilitate binding of 
the complex to a prokaryotic operator region; (c) contacting the first and second fusion 
constructs in a prokaryotic host cell comprising said prokaryotic operator region, wherein 

20 the prokaryotic operator regions is operationally linked to a coding region for one or 
more indicator polypeptides; and (d) determining binding of the complex to the operator 
region, whereby binding of the complex to the operator region identifies the LEP as a 
binding partner for the target peptide. 

In yet a further embodiment, there is provided a heterodimeric binding molecule 

25 comprising (a) a first peptide that binds to a target molecule; (b) a second peptide that 
binds to said target molecule, wherein at least one of said first and second peptides is a 
member of a peptide library; and (c) a linker molecule connecting said first and second 
peptides such that the linking permits said first and second peptides to interact 
independently with said target molecule. The binding molecule may further comprise a 

30 moiety that permits recovery of said molecule, for example, a magnetic bead. The 
heterodimeric binding molecule may be expressed on the surface of a phage. 
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BRIEF DESCRIPTION OF THE DRAWINGS 



The following drawings form part of the present specification and are included to 
5 further demonstrate certain aspects of the present invention. The invention may be better 
understood by reference to one or more of these drawings in combination with the 
detailed description of specific embodiments presented herein. 

FIGS. 1 A and IB - Schematic representation of the screening protocol. The 

10 assay is carried out as described in Zhang et al. (Zhang et aL, 2000). Two plasmids, 
depicted graphically in FIG. 1A. ? are transformed into E. coli. one encodes the lambda 
repressor DNA-binding domain (DBD) fused to a target epitope. The other encodes the 
DBD fused to a peptide library. As shown in FIG. IB., cells that contain library-encoded 
peptides (LEPs) that bind the target epitope, reconstitute the otherwise weak DNA- 

15 binding activity of the lambda Repressor DBD. This renders these cells immune to phage 
lambda infection, allowing them to be selected form the population. 

FIG. 2 - Affinity purification of a recombinant protein based on peptide- 
peptide complexes isolated from the genetic assay. Scheme for purification of GST- 
13-mer by affinity chromatography using immobilized MBP-LEP27 (Zhang et al., 2000). 

20 FIG. 3 - Scheme for purification of a recombinant protein based on a 

peptide-peptide interaction. The yeast Gal80 protein was fused to two tags, six 
histidine and LEPB (a 15-mer selected to bind a 14-mer called ICS using the system 
shown in FIGS. 1 A-B). This protein was expressed in E. coli. A crude extract was split 
and applied either to a standard Ni-NTA agarose resin (Qiagen) or to synthetic ICS 

25 peptide immobilized covalently on Affi-gel. The ICS peptide column provided a high 
level of purity of the LEPB-tagged protein. 

FIG. 4 - Schematic representation of the protocol employed to eliminate 
most false positive in the Repressor reconstitution assay using biological methods. 

FIG. 5 - "Far Western" blot protocol to eliminate false positives. An extract is 

30 prepared from phage-resistant cells that express the DBD-LEP fusion. This is 
electrophoresed through a denaturing gel, then blotted onto a membrane. This is probed 
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with a GST fusion protein containing the target epitope, followed by labeled anti-GST 
antibody to visualize the site(s) of binding of the GST-peptide fusion protein. In this 
case, two of the four selected peptides provided strong signals, showing that they are true 
hits, while two (LEPC and LEPD, provided weak or unobservable signals, indicating that 
5 they are false positives, or binding peptides that associate too weakly to be of practical 
utility. LEPA = Large (40 kD) polypeptide; LEPB = KARKEAELAAATAEQ; LEPC = 
PCP; LEPD = PCHLNCSLQTLSPTRTTPRKHCKHCFKTLSEKMKWN. 

FIG. 6 - Proposed mechanism of the photo-initiated protein cross-linking 
reaction using a ruthenium initiator. 

10 FIG. 7- Diagram of a "semi-synthetic phage display scheme in which a lead 

peptide is attached to a phage-displayed library. The resultant "pincer library, each of 
which contains the same lead peptide, can be screened for high affinity binders to a 
particular target protein. 

FIG. 8 - Schematic diagram of a substrate-targeted protease inhibitor. This 

15 would be a novel application of EBMs capable of binding an epitope containing a 
protease cleavage site. Unlike common protease inhibitors, they could block processing 
of one, but not all, substrates of a particular enzyme. The same approach could 
theoretically be applied to the inhibition of phosphorylation events, ubiquitination and 
many other protein post-translational modifications. 

20 FIG. 9 - Experimental demonstration of substrate-targeted inhibition of 

proteolysis using an epitope-binding peptide (LEPB). The experiment shown 
schematically in FIG. 5 was carried out. Addition of increasing amounts of LEPB to a 
reaction containing a protease and two different substrates inhibited one cleavage event, 
but not the other. Only the substrate containing the epitope recognized by LEPB was 

25 protected from cleavage. GG = GSTneayvhdgpvrslnGFP; GM = 
GSTgvvwfkdsvgvsgnMBP; LEPB = karkeaelaaataeq. 



30 
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DESCRIPTION OF ILLUSTRATIVE EMBODIMENTS 



As discussed, an important goal in chemical biology is to be able to obtain 
specific ligands for any biomolecule of interest. Impressive advances have been made in 
5 isolating molecules, many of which are antibodies or antibody-derived, that bind proteins 
and nucleic acid targets with well-defined macromolecular structures. However, the 
identification of sequence-specific, peptide-binding ligands has been more difficult. In 
addition, even antibodies have major drawbacks: they are tedious and expensive to 
generate, difficult to produce in large quantities, are relatively fragile molecules 

10 unsuitable for certain field applications, and often bind so tightly that they or their target 
proteins are damaged upon attempted extraction. Natural peptide-binding proteins or 
protein domains have been mutagenized to derive species with novel binding specificities 
(Schneider et al 9 1999), but like antibodies, these are globular macromolecules. Thus, 
though the development of synthetic receptor molecules has seen important advances in 

15 the last few years, the field remains in its infancy. Therefore, it remains an important 
goal to develop non-macromolecular species that retain the favorable molecular 
recognition characteristics of antibodies, but can be identified quickly and easily and 
synthesized in large amounts. 

20 A. The Present Invention 

It is envisioned that if it is possible to isolate small peptides that recognize 
specific epitopes within a target protein, then these reagents could be employed as 
capture ligands for that protein. This would obviate the need to create epitope-tagged 
versions of the native proteins for the purpose of immunoaffinity chromatography, a 

25 tedious and imperfect process that is not well-suited for high-throughput proteomic 
applications. An additional attraction of this approach to protein isolation would be that 
many proteins would co-purify with associated partners that could be identified by mass 
spectrometry techniques. This would provide an appealing alternative to the two-hybrid 
assay as a method to identify the binding partners of a given protein of interest. Finally, 

30 as will be described in detail, epitope-binding compounds have exciting potential as 
reagents for controlling the post-translational modification of proteins in a novel fashion. 
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For example, an epitope-binding compound capable of recognizing a site of proteolysis 
or phosphorylation on a target protein could protect that factor from chemical 
modification. This would be an extremely valuable tool for molecular biology 
researchers and could also serve as the paradigm of a new family of therapeutic 
5 compounds. 

The overall goal of this project was to develop a general method for the discovery 
of relatively low molecular weight EBMs that can be chemically synthesized (i.e., they 
are not macromolecules such as antibodies, other types of proteins, or nucleic acids). 
Thus, the inventor initiated an effort to isolate heteromeric complexes comprised of small 

10 peptides, even smaller than leucine zippers, that could be employed as EBMs. As stated 
above, a priori, it was not clear how feasible this endeavor would be. Stable, specific 
complexes between naturally-occurring peptides of less than 25-30 residues are 
essentially unknown, although complexes between small peptides and large proteins are 
very common. This is probably because macromolecular proteins have cavities into 

15 which a peptide can insert, thereby shielding many of the interactions from competition 
by solvent water. This kind of shielding is not possible for complexes between small 
peptides. Additionally, most peptides do not adopt stable secondary or tertiary structures, 
leading to the expectation that the entropic cost of forming a complex between small 
peptides would be much higher than binding of a peptide to a structurally well-defined 

20 protein. These very reasonable biases have presumably deterred efforts to identify such 
complexes and to use them in biotechnology applications. This bias was also supported 
by early efforts of the inventor and his co-workers to accomplish this goal through the 
use of well-established methods, including the yeast two-hybrid system (Fields and Song, 
1989; Yang et al., 1995) and phage display (Burton, 1995). These approaches failed 

25 completely. 

However, the inventor has now demonstrated that it is indeed feasible to isolate 
highly specific complexes between relatively small peptides. A simple genetic selection 
scheme (see FIGS. 1A-1B) is described that allows one to search a peptide library in 
living cells for molecules that bind a given target peptide with high specificity and 
30 reasonable affinity. In one case, a 13-residue sequence from the protease cleavage site of 
the human insulin-like growth factor-I (IGF-I) was employed as the target. A crude 
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extract made from E. coli cells expressing either GST or the GST-target peptide fusion 
protein was applied to amylose columns saturated with either MBP, MBP-LEP27 or 
MBP-60a.a. (a fusion containing a 60 residue peptide unrelated to LEP27). After 
thorough washing, the bound proteins were eluted with maltose and analyzed by SDS- 
5 PAGE and staining with Coomassie Brilliant Blue. The GST-target peptide fusion was 
retained only by the MBP-LEP27 protein. No other proteins were observed. MBP- 
LEP27 did not retain GST alone from an extract. The 27-residue peptide which binds the 
13-mer target with a K D of approximately 2 x 10" 5 M and very high specificity (as 
determined by affinity chromatography, see FIG. 2) was identified from a library of 

10 approximately 50,000 molecules. Using a larger library, a 15mer (LEPB) that binds a 
14mer epitope that is part of human pro-IL-lp using the same system (Zhang et al., 
2000). This selection scheme is applicable to the identification of peptide ligands for 
almost any given target peptide. 

The lambda Repressor reconstitution system has many advantages over other 

15 possible systems for selecting peptide-binding peptides. It is sensitive to even low 
affinity interactions, as is believed will be typical of peptide-peptide contacts. It is 
carried out in E. coli, rather than slower-growing and less easily transformable yeast or 
higher cells. Finally, the fact that the assay is carried out in the presence of a large excess 
of other proteins (the cellular constituents of E, coli), it is anticipated that the binding will 

20 be highly specific. This expectation has been borne out by affinity chromatography 
experiments (FIGS. 2 and 3). 

An alternative transcriptional readout system for use according to the present 
invention is described by Dove et al (1998), incorporated by reference herein. 

25 B. Binding Partner Read-Out Systems 

1 . Lambda Repressor System 

To search a peptide library for molecules that act as selective receptors for a given 
target peptide, an adaptation of a selection scheme developed by Hu et al. for the genetic 
analysis of leucine zipper interactions (Hu et al., 1990; Hu, 1995) was employed. 
30 Binding of X Repressor dimers to cognate operator sites is reduced dramatically when the 
C-terminal dimerization domain is deleted. However, high affinity repressor-operator 
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binding can be reconstituted by fusion of the DNA-binding domain (DBD) to a number 
of heterologous proteins or protein fragments that self-associate and functionally replace 
the native dimerization domain (Cairns et al, 1997; Jappelli and Brenner, 1996; Park and 
Raines, 2000). To adapt this assay for our purposes, two compatible constructs were 
5 made and transformed into E. coli. One encoded the DBD fused to a target peptide (an 
epitope of 8-15 amino acids). The other expressed a fusion of the DBD with a library of 
DNA fragments fused to the DBD. Association of the library-encoded peptide (LEP) 
with the target peptide should reconstitute high-affinity repressor-operator binding. E. 
coli cells carrying a functional repressor are resistant to phage lambda infection, since the 
10 X Repressor blocks the high-level transcription of genes essential for lytic growth 
(Ptashne, 1987). This protocol results in the isolation of many phage-resistant colonies, 
which include cells containing the desired peptide-binding peptides as well as false 
positives. 

15 2. Elimination of False-Positives 

"False positives," i.e., phage-resistant cells that nonetheless do not harbor an 
epitope-binding LEP, can result from several different events. The most common is direct 
or indirect LEP homooligomerization. Another is a promoter mutation that results in 
overexpression of the DBD-LEP fusion protein (the DBD alone retains a very weak 

20 dimerization capacity and will repress if expressed at very high levels). The protocol in 
FIG. 4 is designed to eliminate as many false positives as possible at the 
microbiology/molecular biology level, before proceeding to more labor-intensive 
biochemical analysis of the isolated peptide-peptide complexes. A two step process will 
be employed to eliminate the very common homooligomeric LEPs. 

25 First, colony hybridization will be employed using DNA probes that correspond 

to the sequences encoding homooligomeric peptides. These probes will be obtained as 
follows. The entire library will be transformed into E. coli and challenged with phage in 
the absence of a target plasmid, thus selecting for homooligomeric peptides (the inventor 
has demonstrated the feasibility of this approach previously (Zhang et al., 1999)). All of 

30 the surviving colonies will be scraped off the plate and a preparation of total plasmid 
DNA will be carried out. This mixture will then be used as templates for a PCR™ 
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amplification of the peptide-encoding regions. The primers will be complementary to 
conserved regions of the plasmid flanking the library-encoding regions. When a real 
selection for epitope-binding peptides is carried out, the survivors will be replica plated, 
and the new plate will be probed with these PCR™ products (radiolabeled with 32 P). 
5 Very high stringency conditions will be employed such that annealing of the conserved 
region present in the PCR™ primers and the library-encoding plasmids will not lead to a 
positive signal. Only those cells that do not "light up" with the homooligomer probes 
will be processed further. 

As a second step to ensure that repression is dependent on the presence of the 

10 DBD-target peptide expression plasmid, the inventor will cure the cells of the target- 
encoding plasmid and verify that efficient repression no longer occurs. Initially, the 
inventor did this by gel separation of the linearized library- and target-encoding plasmids, 
followed by religation of the library-encoding plasmid and transformation into fresh cells, 
but this is tedious. The inventor has placed a unique Fsel site (recognition sequence 5'- 

15 GGCCGGCC-3') in the target plasmid, allowing it to selectively linearize the target 
plasmid in a mixed preparation that also contains the library-encoding plasmid. The 
DNA preparation will then be treated with T4 DNA polymerase in the absence of dNTPs 
to allow the voracious exonuclease activity of this enzyme to degrade the linearized 
target-encoding DNA. The resultant DNA will be retransformed into fresh cells and the 

20 transformants will be streaked onto X-gal plates (the receptor strain employed in this 
assay, JH372, has an integrated lacZ gene under the control of a lambda operator (Hu et 
aL 9 1990)). The vast majority of cells will now contain only the library-encoding 
plasmid, so any survivors that make white patches at this point will be discarded. Of 
course, this protocol will result in the loss of a true positive if the library DNA contains a 

25 Fsel site. But this will occur very infrequently since it is an eight base pair site and is 
comprised entirely of C's and G's. 

This two-step hybridization/target plasmid curing protocol should eliminate false 
positives due to the presence of homooligomeric library-encoded peptides completely and 
has the added advantage of also identifying false positives that result from overexpression 

30 of the DBD-LEP fusion protein. 
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To eliminate false positives due to promoter mutations in the target-encoding 
plasmid that result in overexpression, the DBD-LEP-encoding plasmids from the 
remaining apparent positives from the FIG. 4 protocol will be retransformed into fresh 
cells harboring the wild-type target plasmid. All of the transformants should be white 
5 when plated onto X-gal plates. If the cells are blue, they will be discarded. Finally, it is 
important to note that screens that require retransformation into fresh cells that have 
never seen virus will also eliminate false positives due to completely plasmid- 
independent events such as surface receptor mutations that block phage DNA uptake. 
With this optimized protocol, the inventor anticipates that it will take about a week to 
10 identify LEPs that bind any given target epitope and, of course, several different 
selections can be carried out simultaneously. 

3 . Biochemical verification of LEP-epitope binding. 

The putative epitope-binding LEPs that survive the tests described above are then 

15 tested biochemically for epitope binding. In addition to verifying a biochemical 
interaction, it is important to distinguish very weak interactions from stronger ones that 
will prove of practical utility. This is done by expressing a GST fusion protein 
containing the target peptide at the C-terminus. This protein is then employed as a probe 
in a Far- Western blot experiment (FIG. 5). Crude extracts from cells that express the 

20 Repressor DBD-LEP fusion are electrophoresed through a denaturing gel and blotted 
onto a suitable surface, such a nitrocellulose. The blot is then probed with the GST- 
epitope fusion protein, followed by anti-GST antibody and finally an HRP-conjugated 
secondary to visualize the binding events. Lanes in which the band corresponding to the 
Repressor DBD-LEP is bound by the GST-epitope fusion protein are considered true hits. 

25 Another method of utility is affinity chromatography. In this case, the target 

peptide is coupled to a chromatography resin, either as a synthetic peptide or as part of a 
fusion protein. An extract containing the LEP, fused to the Repressor DBD or some 
other protein, is then passed over the column. The extent of binding of the LEP- 
containing protein to the resin is then deduced by some convenient method such as SDS- 

30 PAGE after elution of all bound proteins from the column (see FIGS. 2 and 3). Peptides 
capable of binding selectively to the target epitope in this format are considered true 
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positives and excellent candidates for further development as affinity chromatography 
agents. 

Finally, the most rigorous tests for peptide-peptide interaction involve 
spectroscopic assays. These experiments provide quantitative data regarding the affinity 
5 of the LEP for the epitope target. For example, fluorescence polarization (Heyduk et al., 
1996) can be employed. In this experiment, one of the peptides is synthesized with a 
covalently coupled fluorescein molecule. A low concentration of this species (usually 10" 

O I A 

M to 10" M) is titrated with increasing amounts of a fusion protein containing the 
partner peptide. The increase in the polarization of the fluorescence as a result of binding 
10 of the low molecular weight fluorescein-peptide conjugate to the high molecular weight 
fusion protein is monitored. 

4. List of Suitable Targets 

An advantage of this technology is that it potentially allows the isolation of a 
15 peptide capable of binding almost any epitope. However, we are interested, in particular 
in protein epitopes that contain the sites of post-translational modifications that regulate 
the activity of the protein. This is because the EBM could potentially be employed as a 
reagent to control these events (see below for proof of principle). Some particularly 
important targets would be: (1) the epitopes contained within pro-interleukins that are 
20 cleaved by the corresponding maturases to provide the biologically active interleukin; (2) 
epitopes in transcription factors, such a Ik-B, NF-AT and SREBP that include residues 
that are either cut or phosphorylated to active their transcription potential, (3) epitopes in 
the extracellular oligomerization domains of E-cadherins (there is some evidence that 
molecules able to bind these sequences could promote movement of drugs across the 
25 blood-brain barrier); and (4) epitopes at the far N- or C-termini of any protein, which are 
expected to be available sites of EBMs to be employed as affinity chromatography 
agents. 

30 
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C. Generation of Vectors 

1 . Design of Peptide Targets 

Generally, there are few, if any, constraints on the design of potential peptide 
targets. Perhaps the only considerations are the generally considered minimum epitope 
5 size for a peptide - about 5-6 residues - and the statistical uniqueness in the human 
genome of about 8 residues. 

2. Oligonucleotide Synthesis 

Oligonucleotide synthesis is performed according to standard methods. See, for 
10 example, Itakura and Riggs (1980). Additionally, U.S. Patent 4,704,362; U.S. Patent 
5,221,619, U.S. Patent 5,583,013 each describe various methods of preparing synthetic 
structural genes. 

Oligonucleotide synthesis is well known to those of skill in the art. Various 
different mechanisms of oligonucleotide synthesis have been disclosed in for example, 
15 U.S. Patents 4,659,774, 4,816,571, 5,141,813, 5,264,566, 4,959,463, 5,428,148, 
5,554,744, 5,574,146, 5,602,244, each of which is incorporated herein by reference. 

Basically, chemical synthesis can be achieved by the diester method, the triester 
method polynucleotides phosphorylase method and by solid-phase chemistry. These 
methods are discussed in further detail below. 

20 

i) Diester method 

The diester method was the first to be developed to a usable state, primarily by 
Khorana and co-workers. (Khorana, 1979). The basic step is the joining of two suitably 
protected deoxynucleotides to form a dideoxynucleotide containing a phosphodiester 
25 bond. The diester method is well established and has been used to synthesize DNA 
molecules (Khorana, 1979). 

ii) Triester method 

The main difference between the diester and triester methods is the presence in 
30 the latter of an extra protecting group on the phosphate atoms of the reactants and 
products (Itakura et aL, 1975). The phosphate protecting group is usually a chlorophenyl 
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group, which renders the nucleotides and polynucleotide intermediates soluble in organic 
solvents. Therefore purification's are done in chloroform solutions. Other improvements 
in the method include (i) the block coupling of trimers and larger oligomers, (ii) the 
extensive use of high-performance liquid chromatography for the purification of both 
5 intermediate and final products, and (iii) solid-phase synthesis. 

iii) Polynucleotide phosphorylase method 

This is an enzymatic method of DNA synthesis that can be used to synthesize 
many useful oligodeoxynucleotides (Gillam et al. 9 1978; Gillam et al, 1979). Under 

10 controlled conditions, polynucleotide phosphorylase adds predominantly a single 
nucleotide to a short oligodeoxynucleotide. Chromatographic purification allows the 
desired single adduct to be obtained. At least a trimer is required to start the procedure, 
and this primer must be obtained by some other method. The polynucleotide 
phosphorylase method works and has the advantage that the procedures involved are 

1 5 familiar to most biochemists. 

iv) Solid-phase methods 

Drawing on the technology developed for the solid-phase synthesis of 
polypeptides, it has been possible to attach the initial nucleotide to solid support material 

20 and proceed with the stepwise addition of nucleotides. All mixing and washing steps are 
simplified, and the procedure becomes amenable to automation. These syntheses are now 
routinely carried out using automatic DNA synthesizers. 

Phosphoramidite chemistry (Beaucage, and Lyer, 1992) has become by far the 
most widely used coupling chemistry for the synthesis of oligonucleotides. As is well 

25 known to those skilled in the art, phosphoramidite synthesis of oligonucleotides involves 
activation of nucleoside phosphoramidite monomer precursors by reaction with an 
activating agent to form activated intermediates, followed by sequential addition of the 
activated intermediates to the growing oligonucleotide chain (generally anchored at one 
end to a suitable solid support) to form the oligonucleotide product. 

30 
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3. Library Synthesis 

Peptide libraries are constructed at the DNA level by one of two methods. The 
first is to fragment genomic DNA, either by use of a four base cutter restriction enzyme, 
or by physical shearing, to provide a large number of small fragments that are then cloned 
5 downstream of the Repressor DNA-binding domain. The second is to encode a library of 
a specific length with synthetic oligonucleotides. This is standard practice in the field. A 
single-stranded oligonucleotide is synthesized with constant ends that include the 
appropriate restriction sites to allow subsequent cloning into the vector. The central part 
of the oligonucleotide is made random by programming the synthesizer to insert some 

10 mixture of all four natural nucleotides at each position. This can be adjusted to suppress 
stop codons if desired by restricting the identity of the third nucleotide in each codon. 
The resulting family of molecules is made double stranded by annealing a primer to the 
constant region on one end and replicating the rest of the molecule with a DNA 
polymerase. The ends are then cut with the appropriate restriction enzyme and the 

15 double-stranded DNAs are ligated into the "library vector." The entire collection of 
ligation products is then transformed into E. coli, providing the library. This type of 
protocol is standard in the field and well known to experts in the area such as the 
inventor. Examples of this procedure can be found in the following references (Cwirla et 
al., 1990; Leuther et al., 1993; Needels et al., 1993). 

20 

4. Expression Constructs 

The term "expression construct" refers to any type of genetic construct 
comprising a nucleic acid coding for a RNA capable of being transcribed. In some cases, 
RNA molecules are then translated into a protein, polypeptide, or peptide. In other cases, 

25 these sequences are not translated, for example, in the production of antisense molecules 
or ribozymes. Expression constructs can contain a variety of "control sequences," which 
refer to nucleic acid sequences necessary for the transcription and possibly translation of 
an operably linked coding sequence in a particular host cell. In addition to control 
sequences that govern transcription and translation, vectors and expression constructs 

30 may contain nucleic acid sequences that serve other functions as well and are described 
infra, 
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i) Promoters and Enhancers 
A "promoter" is a control sequence that is a region of a nucleic acid sequence at 
which initiation and rate of transcription are controlled. It may contain genetic elements 
5 at which regulatory proteins and molecules may bind, such as RNA polymerase and other 
transcription factors, to initiate the specific transcription a nucleic acid sequence. The 
phrases "operatively positioned," "operatively linked," "under control," and "under 
transcriptional control" mean that a promoter is in a correct functional location and/or 
orientation in relation to a nucleic acid sequence to control transcriptional initiation 

1 0 and/or expression of that sequence. 

A promoter generally comprises a sequence that functions to position the start site 
for RNA synthesis. The best known example of this is the TATA box, but in some 
promoters lacking a TATA box, such as, for example, the promoter for the mammalian 
terminal deoxynucleotidyl transferase gene and the promoter for the SV40 late genes, a 

15 discrete element overlying the start site itself helps to fix the place of initiation. 
Additional promoter elements regulate the frequency of transcriptional initiation. 
Typically, these are located in the region 30-110 bp upstream of the start site, although a 
number of promoters have been shown to contain functional elements downstream of the 
start site as well. To bring a coding sequence "under the control of a promoter, one 

20 positions the 5' end of the transcription initiation site of the transcriptional reading frame 
"downstream" of (i.e., 3' of) the chosen promoter. The "upstream" promoter stimulates 
transcription of the DNA and promotes expression of the encoded RNA. 

The spacing between promoter elements frequently is flexible, so that promoter 
function is preserved when elements are inverted or moved relative to one another. In the 

25 tk promoter, the spacing between promoter elements can be increased to 50 bp apart 
before activity begins to decline. Depending on the promoter, it appears that individual 
elements can function either cooperatively or independently to activate transcription. A 
promoter may or may not be used in conjunction with an "enhancer," which refers to a 
cis-acting regulatory sequence involved in the transcriptional activation of a nucleic acid 

30 sequence. 
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A promoter may be one naturally associated with a nucleic acid sequence, as may 
be obtained by isolating the 5' non-coding sequences located upstream of the coding 
segment and/or exon. Such a promoter can be referred to as "endogenous." Similarly, an 
enhancer may be one naturally associated with a nucleic acid sequence, located either 
5 downstream or upstream of that sequence. Alternatively, certain advantages will be 
gained by positioning the coding nucleic acid segment under the control of a recombinant 
or heterologous promoter, which refers to a promoter that is not normally associated with 
a nucleic acid sequence in its natural environment. A recombinant or heterologous 
enhancer refers also to an enhancer not normally associated with a nucleic acid sequence 

10 in its natural environment. Such promoters or enhancers may include promoters or 
enhancers of other genes, and promoters or enhancers isolated from any other virus, or 
prokaryotic or eukaryotic cell, and promoters or enhancers not "naturally occurring," i.e., 
containing different elements of different transcriptional regulatory regions, and/or 
mutations that alter expression. For example, promoters that are most commonly used in 

15 recombinant DNA construction include the (3-lactamase (penicillinase), lactose and 
tryptophan (trp) promoter systems. In addition to producing nucleic acid sequences of 
promoters and enhancers synthetically, sequences may be produced using recombinant 
cloning and/or nucleic acid amplification technology, including PCR™, in connection 
with the compositions disclosed herein (see U.S. Patents 4,683,202 and 5,928,906, each 

20 incorporated herein by reference). Furthermore, it is contemplated the control sequences 
that direct transcription and/or expression of sequences within non-nuclear organelles 
such as mitochondria, chloroplasts, and the like, can be employed as well. 

Naturally, it will be important to employ a promoter and/or enhancer that 
effectively directs the expression of the DNA segment in the organelle, cell type, tissue, 

25 organ, or organism chosen for expression. Those of skill in the art of molecular biology 
generally know the use of promoters, enhancers, and cell type combinations for protein 
expression, (see, for example Sambrook etal 1989, incorporated herein by reference). 
The promoters employed may be constitutive, tissue-specific, inducible, and/or useful 
under the appropriate conditions to direct high level expression of the introduced DNA 

30 segment, such as is advantageous in the large-scale production of recombinant proteins 
and/or peptides. The promoter may be heterologous or endogenous. 
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Additionally any promoter/enhancer combination (as per, for example, the 
Eukaryotic Promoter Data Base EPDB, http://www.epd.isb-sib.ch/) could also be used to 
drive expression. Use of a T3, T7 or SP6 cytoplasmic expression system is another 
possible embodiment. 

5 

ii) Other Regulatory Elements 
A specific initiation signal also may be required for efficient translation of coding 
sequences. These signals include the ATG initiation codon or adjacent sequences. 
Exogenous translational control signals, including the ATG initiation codon, may need to 

1 0 be provided. One of ordinary skill in the art would readily be capable of determining this 
and providing the necessary signals. It is well known that the initiation codon must be 
"in-frame" with the reading frame of the desired coding sequence to ensure translation of 
the entire insert. The exogenous translational control signals and initiation codons can be 
either natural or synthetic. The efficiency of expression may be enhanced by the 

1 5 inclusion of appropriate transcription enhancer elements. 

In certain embodiments of the invention, the use of internal ribosome entry sites 
(IRES) elements are used to create multigene, or polycistronic, messages. IRES elements 
are able to bypass the ribosome scanning model of S'-methylated Cap dependent 
translation and begin translation at internal sites (Pelletier & Sonenberg, 1988). IRES 

20 elements from two members of the picornavirus family (polio and encephalomyocarditis) 
have been described (Pelletier & Sonenberg, 1988), as well an IRES from a mammalian 
message (Macejak & Sarnow, 1991). IRES elements can be linked to heterologous open 
reading frames. Multiple open reading frames can be transcribed together, each separated 
by an IRES, creating polycistronic messages. By virtue of the IRES element, each open 

25 reading frame is accessible to ribosomes for efficient translation. Multiple genes can be 
efficiently expressed using a single promoter/enhancer to transcribe a single message (see 
U.S. Patent 5,925,565 and 5,935,819, each herein incorporated by reference). 

Most transcribed eukaryotic RNA molecules will undergo RNA splicing to 
remove introns from the primary transcripts. Vectors containing genomic eukaryotic 

30 sequences may require donor and/or acceptor splicing sites to ensure proper processing of 
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the transcript for protein expression (see, for example, Chandler et ah, 1997; herein 
incorporated by reference). 

The vectors or constructs of the present invention will generally comprise at least 
one termination signal. A "termination signal 1 ' or "terminator" is comprised of the DNA 
5 sequences involved in specific termination of an RNA transcript by an RNA polymerase. 
Thus, in certain embodiments a termination signal that ends the production of an RNA 
transcript is contemplated. A terminator may be necessary in vivo to achieve desirable 
message levels. 

In eukaryotic systems, the terminator region may also comprise specific DNA 

10 sequences that permit site-specific cleavage of the new transcript so as to expose a 
polyadenylation site. This signals a specialized endogenous polymerase to add a stretch 
of about 200 A residues (poly A) to the 3 '-end of the transcript. RNA molecules modified 
with this polyA tail appear to more stable and are translated more efficiently. Thus, in 
other embodiments involving eukaryotes, it is preferred that that terminator comprises a 

15 signal for the cleavage of the RNA, and it is more preferred that the terminator signal 
promotes polyadenylation of the message. The terminator and/or polyadenylation site 
elements can serve to enhance message levels and to minimize read through from the 
cassette into other sequences. 

Terminators contemplated for use in the invention include any known terminator 

20 of transcription described herein or known to one of ordinary skill in the art, including 
but not limited to, for example, the termination sequences of genes, such as for example 
the bovine growth hormone terminator or viral termination sequences, such as for 
example the SV40 terminator. In certain embodiments, the termination signal may be a 
lack of transcribable or translatable sequence, such as due to a sequence truncation. 

25 In expression, particularly eukaryotic expression, one will typically include a 

polyadenylation signal to effect proper polyadenylation of the transcript. The nature of 
the polyadenylation signal is not believed to be crucial to the successful practice of the 
invention, and any such sequence may be employed. Preferred embodiments include the 
SV40 polyadenylation signal or the bovine growth hormone polyadenylation signal, 

30 convenient and known to function well in various target cells. Polyadenylation may 
increase the stability of the transcript or may facilitate cytoplasmic transport. 
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In order to propagate a vector in a host cell, it may contain one or more origins of 
replication sites (often termed "ori"), which is a specific nucleic acid sequence at which 
replication is initiated. Alternatively an autonomously replicating sequence (ARS) can be 
employed if the host cell is yeast. 

5 

iii) Vectors 

The term "vector" is used to refer to a carrier nucleic acid molecule into which a 
nucleic acid sequence can be inserted for introduction into a cell. Typcially, the term 
vector also connotes the ability of the molecule to be replicated. A nucleic acid sequence 

10 can be "exogenous," which means that it is foreign to the cell into which the vector is 
being introduced or that the sequence is homologous to a sequence in the cell but in a 
position within the host cell nucleic acid in which the sequence is ordinarily not found. 
Vectors include plasmids, cosmids, viruses (bacteriophage, animal viruses, and plant 
viruses), and artificial chromosomes (e.g., YACs). One of skill in the art would be well 

15 equipped to construct a vector through standard recombinant techniques (see, for 
example, Maniatis et aL, 1988 and Ausubel et aL, 1994, both incorporated herein by 
reference). 

In certain embodiments, a plasmid vector is contemplated for use to transform a 
host cell In general, plasmid vectors containing replicon and control sequences which 

20 are derived from species compatible with the host cell are used in connection with these 
hosts. The vector ordinarily carries a replication site, as well as marking sequences 
which are capable of providing phenotypic selection in transformed cells. In a non- 
limiting example, E. coli is often transformed using derivatives of pBR322, a plasmid 
derived from an E. coli species. pBR322 contains genes for ampicillin and tetracycline 

25 resistance and thus provides easy means for identifying transformed cells. The pBR 
plasmid, or other microbial plasmid or phage must also contain, or be modified to 
contain, for example, promoters which can be used by the microbial organism for 
expression of its own proteins. 

In addition, phage vectors containing replicon and control sequences that are 

30 compatible with the host microorganism can be used as transforming vectors in 
connection with these hosts. For example, the phage lambda GEM™-1 1 may be utilized 
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in making a recombinant phage vector which can be used to transform host cells, such as, 
for example, E. coli LE392. 

Further useful plasmid vectors include pIN vectors (Inouye^ al, 1985); and 
pGEX vectors, for use in generating glutathione S-transferase (GST) soluble fusion 
5 proteins for later purification and separation or cleavage. Other suitable fusion proteins 
are those with P-galactosidase, ubiquitin, and the like. 

Bacterial host cells, for example, E. coli, comprising the expression vector, are 
grown in any of a number of suitable media, for example, LB. The expression of the 
recombinant protein in certain vectors may be induced, as would be understood by those 
10 of skill in the art, by contacting a host cell with an agent specific for certain promoters, 
e.g., by adding EPTG to the media or by switching incubation to a higher temperature. 
After culturing the bacteria for a further period, generally of between 2 and 24 h, the cells 
are collected by centrifugation and washed to remove residual media. 

15 iv) Cloning Techniques 

Various standard cloning techniques will be utilized according to the present 
invention, including restriction, ligation, transformation, amplification, Southern and 
Northern blotting, etc. A comprehensive reference for these techniques is Sambrook et 
al. (1989). 

20 

v) Gene Transfer Techniques 
Suitable methods for nucleic acid delivery for transformation of a cell for use with 
the current invention are believed to include virtually any method as would be known to 
one of ordinary skill in the art. Such methods include microinjection (Harlan and 

25 Weintraub, 1985; U.S. Patent 5,789,215, incorporated herein by reference); by 
electroporation (U.S. Patent 5,384,253, incorporated herein by reference; Tur-Kaspa et 
al, 1986; Potter et al, 1984); by calcium phosphate precipitation (Graham and Van Der 
Eb, 1973; Chen and Okayama, 1987; Ripped al, 1990); by using DEAE-dextran 
followed by polyethylene glycol (Gopal, 1985); by direct sonic loading 

30 (Fechheimer etaL, 1987); by liposome mediated transfection (Nicolau and Sene, 1982; 
Fraley et al, 1979; Nicolau et al, 1987; Wong et al, 1980; Kaneda et al, 1989; 
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Kato etal, 1991) and receptor-mediated transfection (Wu and Wu, 1987; Wu and Wu, 
1988); by microprojectile bombardment (PCT Application Nos. WO 94/09699 and 
95/06128; U.S. Patents 5,610,042; 5,322,783 5,563,055, 5,550,318, 5,538,877 and 
5,538,880, and each incorporated herein by reference); by agitation with silicon carbide 
5 fibers (Kaeppler et al, 1990; U.S. Patents 5,302,523 and 5,464,765, each incorporated 
herein by reference); by Agrobacteriurn-mediated transformation (U.S. Patents 5,591,616 
and 5,563,055, each incorporated herein by reference); by PEG-mediated transformation 
of protoplasts (Qmirulleh era/., 1993; U.S. Patents 4,684,611 and 4,952,500, each 
incorporated herein by reference); by desiccation/inhibition-mediated DNA uptake 
10 (Potrykus et al, 1985), and any combination of such methods. Through the application 
of techniques such as these cells may be stably or transiently transformed. 



a. Direct Injection 
In certain embodiments, a nucleic acid may be delivered to a cell via direct 
15 microinjection. Direct microinjection has been used to introduce nucleic acid constructs 
into Xenopus oocytes (Harland and Weintraub, 1985). 



b. Electroporation 
In certain embodiments of the present invention, a nucleic acid is introduced into 
20 an organelle, a cell, a tissue or an organism via electroporation. Electroporation involves 
the exposure of a suspension of cells and DNA to a high-voltage electric discharge. In 
some variants of this method, certain cell wall-degrading enzymes, such as 
pectin-degrading enzymes, are employed to render the target recipient cells more 
susceptible to transformation by electroporation than untreated cells (U.S. Patent 
25 5,384,253, incorporated herein by reference). Alternatively, recipient cells can be made 
more susceptible to transformation by mechanical wounding. 

Transfection of eukaryotic cells using electroporation has been quite successful. 
Mouse pre-B lymphocytes have been transfected with human kappa-immunoglobulin 
genes (Potter et al, 1984), and rat hepatocytes have been transfected with the 
30 chloramphenicol acetyltransferase gene (Tur-Kaspa et al., 1986) in this manner. 
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To effect transformation by electroporation in cells such as, for example, plant 
cells, one may employ either friable tissues, such as a suspension culture of cells or 
embryogenic callus or alternatively one may transform immature embryos or other 
organized tissue directly. In this technique, one would partially degrade the cell walls of 
5 the chosen cells by exposing them to pectin-degrading enzymes (pectolyases) or 
mechanically wounding in a controlled manner. Examples of some species which have 
been transformed by electroporation of intact cells include maize (U.S. Patent No. 
5,384,253; Rhodes et al, 1995; D'Halluin et al, 1992), wheat (Zhou et al, 1993), tomato 
(Hou and Lin, 1996), soybean (Christou et al, 1987) and tobacco (Lee et al, 1989). 

10 One also may employ protoplasts for electroporation transformation of plant cells 

(Bates, 1994; Lazzeri, 1995). For example, the generation of transgenic soybean plants 
by electroporation of cotyledon-derived protoplasts is described by Dhir and Widholm in 
International Patent Application No. WO 9217598, incorporated herein by reference. 
Other examples of species for which protoplast transformation has been described 

15 include barley (Lazerri, 1995), sorghum (Battraw et al t 1991), maize 
(Bhattacharjee et al, 1997), wheat (He et al, 1994) and tomato (Tsukada, 1989). 

c. Calcium Phosphate 
In other embodiments of the present invention, a nucleic acid is introduced to the 
20 cells using calcium phosphate precipitation. Human KB cells have been transfected with 
adenovirus 5 DNA (Graham and Van Der Eb, 1973) using this technique. Also in this 
manner, mouse L(A9), mouse CI 27, CHO, CV-1, BHK, NIH3T3 and HeLa cells were 
transfected with a neomycin marker gene (Chen and Okayama, 1987), and rat 
hepatocytes were transfected with a variety of marker genes (Rippe et al, 1990). 



25 



d. DEAE-Dextran 
In another embodiment, a nucleic acid is delivered into a cell using 
DEAE-dextran followed by polyethylene glycol. In this manner, reporter plasmids were 
introduced into mouse myeloma and erythroleukemia cells (Gopal, 1985). 



30 
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e. Sonication Loading 

Additional embodiments of the present invention include the introduction of a 
nucleic acid by direct sonic loading. LTK" fibroblasts have been transfected with the 
thymidine kinase gene by sonication loading (Fechheimer et aL, 1987), 

5 

f. Liposome-Mediated Transfection 

In a further embodiment of the invention, a nucleic acid may be entrapped in a 
lipid complex such as, for example, a liposome. Liposomes are vesicular structures 
characterized by a phospholipid bilayer membrane and an inner aqueous medium. 

10 Multilamellar liposomes have multiple lipid layers separated by aqueous medium. They 
form spontaneously when phospholipids are suspended in an excess of aqueous solution. 
The lipid components undergo self-rearrangement before the formation of closed 
structures and entrap water and dissolved solutes between the lipid bilayers (Ghosh and 
Bachhawat, 1991). Also contemplated is an nucleic acid complexed with Lipofectamine 

1 5 (Gibco BRL) or Superfect (Qiagen). 

Liposome-mediated nucleic acid delivery and expression of foreign DNA in vitro 
has been very successful (Nicolau and Sene, 1982; Fraley et al, 1979; 
Nicolau et aL, 1987). The feasibility of liposome-mediated delivery and expression of 
foreign DNA in cultured chick embryo, HeLa and hepatoma cells has also been 

20 demonstrated (Wong et al, 1980). 

In certain embodiments of the invention, a liposome may be complexed with a 
hemagglutinating virus (HVJ). This has been shown to facilitate fusion with the cell 
membrane and promote cell entry of liposome-encapsulated DNA (KanedaeM/., 1989). 
In other embodiments, a liposome may be complexed or employed in conjunction with 

25 nuclear non-histone chromosomal proteins (HMG-1) (Kato etal. } 1991). In yet further 
embodiments, a liposome may be complexed or employed in conjunction with both HVJ 
and HMG-1. In other embodiments, a delivery vehicle may comprise a ligand and a 
liposome. 
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g. Receptor Mediated Transfection 
Still further, a nucleic acid may be delivered to a target cell via receptor-mediated 
delivery vehicles. These take advantage of the selective uptake of macromolecules by 
receptor-mediated endocytosis that will be occurring in a target cell. In view of the cell 
5 type-specific distribution of various receptors, this delivery method adds another degree 
of specificity to the present invention. 

Certain receptor-mediated gene targeting vehicles comprise a cell 
receptor-specific ligand and a nucleic acid-binding agent. Others comprise a cell 
receptor-specific ligand to which the nucleic acid to be delivered has been operatively 
10 attached. Several ligands have been used for receptor-mediated gene transfer (Wu and 
Wu, 1987; Wagners al, 1990; Perales et aL, 1994; Myers, EPO 0273085), which 
establishes the operability of the technique. Specific delivery in the context of another 
mammalian cell type has been described (Wu and Wu, 1993; incorporated herein by 
reference). In certain aspects of the present invention, a ligand will be chosen to 
1 5 correspond to a receptor specifically expressed on the target cell population. 

In other embodiments, a nucleic acid delivery vehicle component of a 
cell-specific nucleic acid targeting vehicle may comprise a specific binding ligand in 
combination with a liposome. The nucleic acid(s) to be delivered are housed within the 
liposome and the specific binding ligand is functionally incorporated into the liposome 
20 membrane. The liposome will thus specifically bind to the receptor(s) of a target cell and 
deliver the contents to a cell, Such systems have been shown to be functional using 
systems in which, for example, epidermal growth factor (EGF) is used in the 
receptor-mediated delivery of a nucleic acid to cells that exhibit upregulation of the EGF 
receptor. 

25 In still further embodiments, the nucleic acid delivery vehicle component of a 

targeted delivery vehicle may be a liposome itself, which will preferably comprise one or 
more lipids or glycoproteins that direct cell-specific binding. For example, 
lactosyl-ceramide, a galactose-terminal asialganglioside, have been incorporated into 
liposomes and observed an increase in the uptake of the insulin gene by hepatocytes 

30 (Nicolau et aL, 1987). It is contemplated that the tissue-specific transforming constructs 
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of the present invention can be specifically delivered into a target cell in a similar 
manner. 

h. Microprojectile Bombardment 
5 Microprojectile bombardment techniques can be used to introduce a nucleic acid 

into at least one, organelle, cell, tissue or organism (U.S. Patent 5,550,318; U.S. Patent 
5,538,880; U.S. Patent 5,610,042; and PCT Application WO 94/09699; each of which is 
incorporated herein by reference). This method depends on the ability to accelerate 
DNA-coated microprojectiles to a high velocity allowing them to pierce cell membranes 
10 and enter cells without killing them (Klein etal, 1987). There are a wide variety of 
microprojectile bombardment techniques known in the art, many of which are applicable 
to the invention. 

Microprojectile bombardment may be used to transform various cell(s), tissue(s) 
or organism(s), such as for example any plant species. Examples of species which have 

15 been transformed by microprojectile bombardment include monocot species such as 
maize (PCT Application WO 95/06128), barley (Ritala <?* a/., 1994; 
Hensgens etal, 1993), wheat (U.S. Patent 5,563,055, incorporated herein by reference), 
rice (Hensgens et al, 1993), oat (Torbet et al, 1995; Torbet et al, 1998), rye 
(Hensgens et al, 1993), sugarcane (Bower et al, 1992), and sorghum (Casas et al, 1993; 

20 Hagio et al, 1991); as well as a number of dicots including tobacco (Tomes et al, 1990; 
Buising and Benbow, 1994), soybean (U.S. Patent 5,322,783, incorporated herein by 
reference), sunflower (Knittel a/. 1994), peanut (Singsit et al, 1997), cotton (McCabe 
and Martinell, 1993), tomato (VanEck etal 1995), and legumes in general (U.S. Patent 
5,563,055, incorporated herein by reference). 

25 In this microprojectile bombardment, one or more particles may be coated with at 

least one nucleic acid and delivered into cells by a propelling force. Several devices for 
accelerating small particles have been developed. One such device relies on a high 
voltage discharge to generate an electrical current, which in turn provides the motive 
force (Yang etal, 1990). The microprojectiles used have consisted of biologically inert 

30 substances such as tungsten or gold particles or beads. Exemplary particles include those 
comprised of tungsten, platinum, and preferably, gold. It is contemplated that in some 
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instances DNA precipitation onto metal particles would not be necessary for DNA 
delivery to a recipient cell using microprojectile bombardment. However, it is 
contemplated that particles may contain DNA rather than be coated with DNA. 
DNA-coated particles may increase the level of DNA delivery via particle bombardment 
5 but are not, in and of themselves, necessary. 

For the bombardment, cells in suspension are concentrated on filters or solid 
culture medium. Alternatively, immature embryos or other target cells may be arranged 
on solid culture medium. The cells to be bombarded are positioned at an appropriate 
distance below the macroprojectile stopping plate. 

10 An illustrative embodiment of a method for delivering DNA into a cell (e.g., a 

plant cell) by acceleration is the Biolistics Particle Delivery System, which can be used to 
propel particles coated with DNA or cells through a screen, such as a stainless steel or 
Nytex screen, onto a filter surface covered with cells, such as for example, a monocot 
plant cells cultured in suspension. The screen disperses the particles so that they are not 

15 delivered to the recipient cells in large aggregates. It is believed that a screen intervening 
between the projectile apparatus and the cells to be bombarded reduces the size of 
projectiles aggregate and may contribute to a higher frequency of transformation by 
reducing the damage inflicted on the recipient cells by projectiles that are too large. 

20 D. Separation Technology 

1 . Chromatography 

Any of a wide variety of chromatographic procedures may be employed according 
to the present invention. For example, thin layer chromatography, gas chromatography, 
high performance liquid chromatography, paper chromatography, affinity 

25 chromatography or supercritical flow chromatography may be used to effect separation of 
various chemical species. 

Partition chromatography is based on the theory that if two phases are in contact 
with one another, and if one or both phases constitute a solute, the solute will distribute 
itself between the two phases. Usually, partition chromatography employs a column, 

30 which is filled with a sorbent and a solvent. The solution containing the solute is layered 
on top of the column. The solvent is then passed through the column, continuously, 
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which permits movement of the solute through the column material. The solute can then 
be collected based on its movement rate. The two most common types of partition 
chromatograph are paper chromatograph and thin-layer chromatograph (TLC); together 
these are called adsorption chromatography. In both cases, the matrix contains a bound 
5 liquid. Other examples of partition chromatography are gas-liquid and gel 
chromatography. 

Paper chromatography is a variant of partition chromatography that is performed 
on cellulose columns in the form of a paper sheet. Cellulose contains a large amount of 
bound water even when extensively dried. Partitioning occurs between the bound water 

10 and the developing solvent. Frequently, the solvent used is water. Usually, very small 
volumes of the solution mixture to be separated is placed at top of the paper and allowed 
to dry. Capillarity draws the solvent through the paper, dissolves the sample, and moves 
the components in the direction of flow. Paper chromatograms may be developed for 
either ascending or descending solvent flow. Two dimensional separations are permitted 

15 by changing the axis of migration 90° after the first run.. 

Thin layer chromatography (TLC) is very commonly used to separate lipids and, 
therefore, is considered a preferred embodiment of the present invention. TLC has the 
advantages of paper chromatography, but allows the use of any substance that can be 
finely divided and formed into a uniform layer. In TLC, the stationary phase is a layer of 

20 sorbent spread uniformly over the surface of a glass or plastic plate. The plates are 
usually made by forming a slurry of sorbent that is poured onto the surface of the gel 
after creating a well by placing tape at a selected height along the perimeter of the plate. 
After the sorbent dries, the tape is removed and the plate is treated just as paper in paper 
chromatography. The sample is applied and the plate is contacted with a solvent. Once 

25 the solvent has almost reached the end of the plate, the plate is removed and dried. Spots 
can then be identified by fluorescence, immunologic identification, counting of 
radioactivity, or by spraying varying reagents onto the surface to produce a color change. 

In gas-liquid chromatography (GLC), the mobile phase is a gas and the stationary 
phase is a liquid adsorbed either to the inner surface of a tube or column or to a solid 

30 support. The liquid usually is applied as a solid dissolved in a volatile solvent such as 
ether. The sample, which may be any sample that can be volatized, is introduced as a 
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liquid with an inert gas, such as helium, argon or nitrogen, and then heated. This gaseous 
mixture passes through the tubing. The vaporized compounds continually redistribute 
themselves between the gaseous mobile phase and the liquid stationary phase, according 
to their partition coefficients. 
5 The advantage of GLC is in the separation of small molecules. Sensitivity and 

speed are quite good, with speeds that approach 1000 times that of standard liquid 
chromatography. By using a non-destructive detector, GLC can be used preparatively to 
purify grams quantities of material. The principal use of GLC has been in the separation 
of alcohols, esters, fatty acids and amines. 

10 Gel chromatography, or molecular sieve chromatography, is a special type of 

partition chromatography that is based on molecular size, The theory behind gel 
chromatography is that the column, which is prepared with tiny particles of an inert 
substance that contain small pores, separates larger molecules from smaller molecules as 
they pass through or around the pores, depending on their size. As long as the material of 

15 which the particles are made does not adsorb the molecules, the sole factor determining 
rate of flow is the size. Hence, molecules are eluted from the column in decreasing size, 
so long as the shape is relatively constant. Gel chromatography is unsurpassed for 
separating molecules of different size because separation is independent of all other 
factors such as pH, ionic strength, temperature, etc. There also is virtually no adsorption, 

20 less zone spreading and the elution volume is related in a simple matter to molecular 
weight. 

The gel material for gel chromatography is a three-dimensional network whose 
structure is usually random. The gels consist of cross-linked polymers that are generally 
inert, do not bind or react with the material being analyzed, and are uncharged. The 
25 space filled within the gel is filled with liquid and this liquid occupies most of the gel 
volume. Common gels are dextran, agarose and polyacrylamide; they are used for 
aqueous solution. 

High Performance Liquid Chromatography (HPLC) is characterized by a very 
rapid separation with extraordinary resolution of peaks. This is achieved by the use of 
30 very fine particles and high pressure to maintain and adequate flow rate. Separation can 
be accomplished in a matter of minutes, or a most an hour. Moreover, only a very small 
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volume of the sample is needed because the particles are so small and close-packed that 
the void volume is a very small fraction of the bed volume. Also, the concentration of 
the sample need not be very great because the bands are so narrow that there is very little 
dilution of the sample. 

Affinity Chromatography is a chromatographic procedure that relies on the 
specific affinity between a substance to be isolated and a molecule that it can specifically 
bind to. This is a receptor-ligand type interaction. The column material is synthesized by 
covalently coupling one of the binding partners to an insoluble matrix. The column 
material is then able to specifically adsorb the substance from the solution. Elution 
occurs by changing the conditions to those in which binding will not occur (alter pH, 
ionic strength, temperature, etc.). 

The matrix should be a substance that itself does not adsorb molecules to any 
significant extent and that has a broad range of chemical, physical and thermal stability. 
The ligand should be coupled in such a way as to not affect its binding properties. The 
ligand should also provide relatively tight binding. And it should be possible to elute the 
substance without destroying the sample or the ligand. One of the most common forms 
of affinity chromatography is immunoaffinity chromatography. The generation of 
antibodies that would be suitable for use in accord with the present invention is discussed 
below. 

2. Electrophoresis 

Most electrophoresis embodiments will rely on polyacrylamide or agaraose media 
for separation. 

i) Protein 

Protein electrophoresis normally employs polyacrylamide resins, often using 
sodium dodecyl sulfate to facilitate size/charge separation. Other protein separations may 
be based on the isoelectric point of the protein. Protein gels also may be run under 
denaturing or non-denaturing conditions. Staining techniques such as Coomassie Stain 
and Silver Stain can be utilized on gel electrophoresed proteins. 
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ii) Nucleic Acids 
Preparative and analytical nucleic acid electrophoresis may employ either agarose 
or polyacrylamide for a separation matrix, with the former preferred for larger fragments. 
Staining is usually accomplished with ethidium bromide. In other embodiments, nucleic 
5 acids may be integrally labeled with dyes or radiolabels. 

E. Analytical Techniques 

1 . Western-blot & Far Western-blot 

Western blots involve the use of antibodies to identify proteins that have been 
10 separated electophoretically. Typically, the electrophoresis is PAGE, either denaturing or 
non-denaturing depending on the target. Following separation, the gel is then "blotted" 
on to a suitable 

2. Mass Spectroscopy 

15 While mass spectrometry-based characterization of multi-protein complexes is not 

a central focus of the invention, it is important to highlight at least one newly developed 
technique. Yates and coworkers (Link et al., 1999) have described a method for the 
identification of components of yeast multi-protein complexes that takes advantage of the 
fact that the sequence of the S. cerevisiae genome is known. A purified complex is 

20 denatured and subjected to a limit protease digest with trypsin or some other protease 
with a known sequence selectivity. The resultant complex mixture of peptides are then 
separated through two HPLC columns (a cation exchange followed by a reverse phase) 
linked sequentially and the eluate is introduced directly to a tandem mass spectrometer. 
These mass data are then transferred to a computer that is programmed with the masses 

25 of all possible yeast protein tryptic digestion products. The computer matches the 
experimental data to the data base and lists the proteins present in the original sample. 
More than 100 proteins can be uniquely identified in a single study using this completely 
hands-off technique. 

Since the human genome sequence will soon be known, this extremely powerful 
30 technique can be applied to human biology problems in the near future. The sample 
requirements for this technique are about 1 |ug of each individual protein detected. Thus, 
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for a complex comprised of 20 proteins, a not insignificant quantity of about 20 ng must 
be purified. The main point is that, to employ DALPC or other powerful mass 
spectrometry-based identification tools, preparative quantities of the complex of interest 
must be purified, not a few nanograms as are suitable for Western blotting. This will be 
5 facilitated greatly if one had in hand an EBM capable of binding the native multi-protein 
complex. Thus, the full implementation of DALPC and related techniques will require 
the present invention to rapidly and easily purify interesting complexes on this scale. 

3. NMR 

Nuclear magnetic resonance (NMR) is occurs when the nuclei of certain atoms 
10 are immersed in a static magnetic field and exposed to a second oscillating magnetic 
field. Some nuclei experience this phenomenon, and others do not, dependening upon 
whether they possess a property called spin. Spin is effectively a small magnetic field, 
and can cause the nucleus to produce an NMR signal. NMR spectroscopy involves the 
use of NMR to study physical, chemical, and biological properties of matter, determining 
15 the structure more proteins and protein complexes. Time domain NMR spectroscopic 
techniques are used to probe molecular dynamics in solutions. Solid state NMR 
spectroscopy is used to determine the molecular structure of solids. 

NMR samples generally are dissolved in a deuterium lock solvent, of which 
several are available. Most high resolution NMR samples are prepared and run in 5 mm 
20 glass NMR tubes. The animation window depicts a sample tube filled with solvent such 
that it fills the RF coil. The concentration of the sample should be great enough to give a 
good signal-to-noise ratio in your spectrum, yet minimize exchange effects found at high 
concentrations. The exact concentration will depend on the sensitivity of the 
spectrometer. 

25 

4. Quantification of Affinities & Measurement of Dissociation Constants 
The affinities of the isolated peptides for their epitope targets can be determined 

by a number of methods. These include fusion of one or both of the interacting peptides 
to heterologous proteins, followed by the binding of one of the fusions to an affinity 
30 resin. The affinity of the partner fusion protein for this resin is then determined. Another 
common technique is surface plasmon resonance, in which one of the interacting peptides 
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or fusion proteins is immobilized on an appropriately modified gold surface, and the on 
and off rates of the partner peptide are measured using the commercially available 
BIAcore instrument or an equivalent. The third method employed commonly by the 
inventor is fluorescence polarization, in which a fluorescently-labeled derivative of one 
5 of the peptides is titrated with the other (or the corresponding fusion protein) and the 
increase in the fluorescence polarization as a function of association is monitored. 

There are many other techniques that could be employed to measure the affinity 
of the selected peptides for the epitope or a protein containing the epitope. These are 
generally well known to an expert in the field such as the inventor. 

10 
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F. Methods to Improve Affinity of Low- to Moderate- Affinity Peptides 

1 . Co-oligomerization 

One approach to improving the affinity of certain peptides is through co- 
oligomerization. For example, acryloyl-functionalized peptides can be co-oligomerized 
5 with acrylamide if they contain an acryloyl group, for example at their N-terminus 
(O'Brian-Simpson et al., 1997). To effect the synthesis of an N-acryloyl peptide, lysines 
on the LEPs would be protected with the MenPoc group (an o-nitrobenzyl-containing 
alcohol or amine protecting group) (Fodor et al., 1991; Jacobs and Fodor, 1994; McGall 
et al., 1997)or some other substituent that will survive the conditions used to deprotect 

10 Fmoc groups and cleave the synthetic peptide from the chain. After the reaction of the 
peptides and an activated ester derivative of acrylic acid, the lysines will be exposed by 
photochemical removal of the MenPoc group (UV irradiation at 350 nm. The peptide 
would then be co-polymerized with acrylamide by the addition of APS and TEMED. 
Conditions will be adjusted such that oligomers of about 50 residues are produced with 

15 approximately 10% of these being LEPs, with rest of polymer comprised of acrylamide, 
which will act as spacer. This will obviously be a mixed population of oligomers. The 
hope will be that at least some of the population will have the appropriate LEPs spaced at 
an appropriate distance to be able to bind in a chelating fashion the multi-protein complex 
of interest. This type of two-point binding is expected to lead to a substantial increase in 

20 the affinity of the LEPs for the complex relative to a single LEP-epitope interaction. 

Two types of oligomers can be envisioned. One is where only a single LEP is 
included in the chain. These "homooligomers" would be anticipated to have an increased 
affinity from proteins containing the target epitope that exist as homodimers or higher 
order homooligomers. In the more general case, one can imagine co-polymerizing two 

25 different epitope-binding peptides that bind distinct epitopes in the same protein. These 
oligomers should be capable of tight two point binding to even monomeric proteins. 

2. Cross-linking 

The inventors are investigating a fundamentally different strategy to make the 
30 genetically selected peptides useful in various formats, including chip assays. As part of 
this project, the inventors have developed revolutionary new cross-linking chemistry that 
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is a 1000-fold more efficient than known methods REFS. The reaction involves 
photolysis of metal complexes in the presence of an electron acceptor. This generates a 
high-valent metal complex that abstracts an electron from a protein tyrosine or tryptophan 
side chain. The resultant protein radical then couples rapidly with a nearby nucleophilic 
group (see FIG. 6). If that group is provided by a partner protein, an irreversible cross- 
link results. If the inventors were to include a tyrosine in the LEP (if it does not already 
contain one), it seems likely that the inventors could trap the non-covalent peptide- 
epitope complex covalently. The inventor has demonstrated that this reaction works well 
in crude extracts, so such a protocol could be carried out without the need for prior 
purification of the target protein. This should be particularly efficient when the epitope is 
in the context of the intact protein and the tyrosyl radical has a large surface on which it 
can couple. Since the peptide-epitope interaction is so specific, the inventor anticipates 
that this strategy can be employed to covalently couple an immobilized, genetically 
selected peptide to essentially only the target protein in a crude extract. If the peptide 
were labeled in some unique was, for example with biotin or a radiolabel, then the 
covalenlty trapped peptide-protein complex could be visualized easily or affinity purified. 

3. Photo-trapping 
A related strategy to improve affinity is to simply append the cross-linking reagent to the 
LEP rather than to employ it "in trans" as described above. This involves the chemical 
attachment of the appropriate metal-ligand complex to the peptide of interest. Particular 
examples would be derivatives of tris(2,2'-bipyridyl)Ru(II) dication (Fancy and 
Kodadek, 1999) or palladium(II) porphyrins (Kim et al., 1999). When activated 
photolytically or chemically, the appended cross-linkers would be expected to link the 
peptide to the target protein covalently. Experiments have demonstrated that molecules 
not closely associated are not cross-linked by these reagents, therefore covalent 
attachment of the peptide to the target protein is expected to be highly specific, reflecting 
the intrinsic binding properties of the peptide. 
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4. Mini-Pockets 

It is the belief of the inventor that if one wishes to generate very high affinity 
epitope-binding molecules, it will be necessary to elaborate the simple epitope-binding 
peptides so as to form mini-pockets that will allow the epitope-receptor interactions to be 
better shielded from water. The inventor is exploring two such avenues of research, one 
which relies on organic chemistry-based approaches, and the other molecular biology. 
Each will involve a strategy not unlike the molecular forcep approach used by the CSH 
group, but with the critical difference that the arms of the forcep will be different. One 
will be the epitope-binding peptide (EBP) selected in the genetic assay, while the other 
will be either another peptide or a synthetic, non-peptide oligomer. 

The chemical approach will involve the following steps. A library of oligomeric 
compounds (these could be peptides, peptoids or other non-peptidic species) will be 
synthesized by standard split/pool methods (Borman, 1997) on Tentagel beads. The 
library will then be capped with O-amino-a-hydroxy acetate to provide a site for specific 
attachment of an unprotected peptide segment. This will be accomplished by condensing 
the library with a peptide derivative produced by the oxidation of an N-terminal serine 
derivative of a genetically-selected peptide (Mikolajczyk et al., 1994; Rose et aL, 1996). 
Resultant oxime formation will provide a library of "pincer" molecules, all of which 
contain two arms, one which is the genetically-selected peptide, the other of which is a 
library of oligomers. This bead-bound library will then be screened against the labeled 
epitope, or a protein containing the epitope, under conditions (determined empirically), 
where the genetically selected peptide dos not bind the epitope strongly enough to 
provide a hit. In other words, the inventor will ask for the "library arm" to participate in 
binding so as to provide a higher affinity interaction. 

The chemistry described above for the construction of pincer libraries is favored, 
since the oxime connection between the library and the lead peptide is reversible (Rose et 
al., 1996), allowing recycling of the library. However, a number of other strategies could 
be conceived by an expert in the field of organic chemistry such as the inventor. In 
general, any chemistry that provides a reversible connection between the bead-bound 
library and the lead peptide is claimed. 
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The biological approach (see FIG. 7) will be similar in concept, except that the 
pincer library will be made and displayed on the surface of a bacteriophage. In this 
manifestation of the pincer concept, a phage displayed peptide library will be constructed 
using standard techniques with a specific protease site just N-terminal to it. This protease 
site will allow the exposure of an N-terminal serine after cleavage, which can, in turn, be 
oxidized selectively to an aldehyde. This provides a unique functional group on the 
surface of a phage to which an O-amino derivative of the genetically selected lead 
peptide can be attached via oxime formation (FIG. 7). Then, phage display will be 
employed to find phage that bind to an immobilized epitope target, again using conditions 
where the original peptide-epitope interaction is insufficient for stable association. 

The inventor is confident that both of these approaches will provide high affinity 
epitope-binding molecules. Since all of the compounds in the library will have at least a 
uM affinity for the epitope, all that is required is for the library-derived molecules to add 
another three orders of magnitude to the overall affinity. Put another way, the inventor is 
looking for compounds that can wrap around the original peptide-epitope complex with a 
K D of at least 10" 3 M, since coupled equilibria of 10" 6 M and 10" 3 M give an overall 
dissociation constant of 10" 9 M, typical of a good monoclonal antibody-epitope complex. 

Finally, it should be noted that both of the above schemes for the identification of 
pincer-type EBMs is novel and distinct from previous strategies by which molecules 
capable of two-point binding to a target have been obtained. Previously, all such 
molecules arose from the identification of two or more molecules capable of binding a 
target protein non-competitively (Hajduk et al., 1997; Maly et al., 2000; Shuker et al, 
1996). A linker arm was then designed or discovered that allowed the two ligands to 
bind cooperatively. This is an effective, but tedious, process and thus is poorly suited for 
high-throughput applications. The method of the inventor, where the second arm of the 
pincer and the linker are selected in a single combinatorial screen is a significant advance 
over these methods. 
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G. Secondary Screens 

1 . Peptide Binding Studies 

For screening the synthetic pincer libraries described above, relatively standard 
techniques will be employed. The epitope, or a protein containing it, will be labeled with 
an easily detectable molecule, such as fluorescein or biotin. This molecule will then be 
exposed to the library under demanding conditions (either low epitope concentration or 
high salt, etc.) so as to demand high affinity binding. Beads that bind the tagged protein, 
as visualized by fluorescence, for example, will be manually picked under a dissecting 
microscope. The identity of the variable oligomer will then be determined by direct 
sequencing using Edman degradation chemistry. 

For screening the phage-displayed pincer libraries, a standard panning protocol 
will be employed, as described by the inventor previously (Han and Kodadek, 2000). An 
immobilized peptide comprising the epitope or an immobilized protein containing the 
epitope of interest will be employed as the target. Again, harsh conditions (determined 
empirically) will be employed that will require a very high affinity binding event. The 
identity of the phage-displayed peptide encoded by the library in the high affinity binders 
will be determined by DNA sequencing of DNA isolated from the phage. 

2. Mutagenesis Procedures 

In certain embodiments, it may be desirable to start with a given ligand and then 
attempt to alter its binding properties (affinity, specificity, stability, etc.). Where 
employed, mutagenesis will be accomplished by a variety of standard, mutagenic 
procedures. Mutation is the process whereby changes occur in the quantity or structure 
of an organism. Mutation can involve modification of the nucleotide sequence of a single 
gene, blocks of genes or whole chromosome. Changes in single genes may be the 
consequence of point mutations which involve the removal, addition or substitution of a 
single nucleotide base within a DNA sequence, or they may be the consequence of 
changes involving the insertion or deletion of large numbers of nucleotides. 

Mutations can arise spontaneously as a result of events such as errors in the 
fidelity of DNA replication or the movement of transposable genetic elements 
(transposons) within the genome. They also are induced following exposure to chemical 
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or physical mutagens. Such mutation-inducing agents include ionizing radiations, 
ultraviolet light and a diverse array of chemical such as alkylating agents and polycyclic 
aromatic hydrocarbons all of which are capable of interacting either directly or indirectly 
(generally following some metabolic biotransformations) with nucleic acids. The DNA 
5 lesions induced by such environmental agents may lead to modifications of base 
sequence when the affected DNA is replicated or repaired and thus to a mutation. 
Mutation also can be site-directed through the use of particular targeting methods. 

i) Random Mutagenesis 

1 0 a. Chemical mutagenesis 

Chemical mutagenesis offers certain advantages, such as the ability to find a full 
range of mutant alleles with degrees of phenotypic severity, and is facile and inexpensive 
to perform. The majority of chemical carcinogens produce mutations in DNA. 
Benzo(a)pyrene, N-acetoxy-2-acetyl amino fluorene and aflotoxin Bl cause GC to TA 

15 transversions in bacteria and mammalian cells. Benzo(a)pyrene also can produce base 
substitutions such as AT to TA. N-nitroso compounds produce GC to AT transitions. 
Alkylation of the 04 position of thymine induced by exposure to n-nitrosoureas results in 
TA to CG transitions. 

A high correlation between mutagenicity and carcinogenity is the underlying 

20 assumption behind the Ames test (McCann et aL, 1975) which speedily assays for 
mutants in a bacterial system, together with an added rat liver homogenate, which 
contains the microsomal cytochrome P450, to provide the metabolic activation of the 
mutagens where needed. 

In vertebrates, several carcinogens have been found to produce mutation in the 

25 ras proto-oncogene. N-nitroso-N-methyl urea induces mammary, prostate and other 
carcinomas in rats with the majority of the tumors showing a G to A transition at the 
second position in codon 12 of the Ha-ras oncogene. Benzo(a)pyrene-induced skin 
tumors contain A to T transformation in the second codon of the Ha-ras gene. 
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b. Radiation Mutagenesis 
The integrity of biological molecules is degraded by the ionizing radiation. 
Adsorption of the incident energy leads to the formation of ions and free radicals, and 
breakage of some covalent bonds. Susceptibility to radiation damage appears quite 
5 variable between molecules, and between different crystalline forms of the same 
molecule. It depends on the total accumulated dose, and also on the dose rate (as once 
free radicals are present, the molecular damage they cause depends on their natural 
diffusion rate and thus upon real time). Damage is reduced and controlled by making the 
sample as cold as possible. 

10 Ionizing radiation causes DNA damage and cell killing, generally proportional to 

the dose rate. Ionizing radiation has been postulated to induce multiple biological effects 
by direct interaction with DNA, or through the formation of free radical species leading 
to DNA damage (Hall, 1988). These effects include gene mutations, malignant 
transformation, and cell killing. Although ionizing radiation has been demonstrated to 

15 induce expression of certain DNA repair genes in some prokaryotic and lower eukaryotic 
cells, little is known about the effects of ionizing radiation on the regulation of 
mammalian gene expression (Borek, 1985). Several studies have described changes in 
the pattern of protein synthesis observed after irradiation of mammalian cells. For 
example, ionizing radiation treatment of human malignant melanoma cells is associated 

20 with induction of several unidentified proteins (Boothman et ah, 1989), Synthesis of 
cyclin and co-regulated polypeptides is suppressed by ionizing radiation in rat REF52 
cells, but not in oncogene-transformed REF52 cell lines (Lambert and Borek, 1988). 
Other studies have demonstrated that certain growth factors or cytokines may be involved 
in x-ray-induced DNA damage. In this regard, platelet-derived growth factor is released 

25 from endothelial cells after irradiation (Witte, et at, 1989). 

In the present invention, the term "ionizing radiation" means radiation comprising 
particles or photons that have sufficient energy or can produce sufficient energy via 
nuclear interactions to produce ionization (gain or loss of electrons). An exemplary and 
preferred ionizing radiation is an x-radiation. The amount of ionizing radiation needed in 

30 a given cell generally depends upon the nature of that cell. Typically, an effective 
expression-inducing dose is less than a dose of ionizing radiation that causes cell damage 
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or death directly. Means for determining an effective amount of radiation are well known 
in the art. 

In a certain embodiments, an effective expression inducing amount is from about 
2 to about 30 Gray (Gy) administered at a rate of from about 0.5 to about 2 Gy/minute. 
5 Even more preferably, an effective expression inducing amount of ionizing radiation is 
from about 5 to about 15 Gy. In other embodiments, doses of 2-9 Gy are used in single 
doses. An effective dose of ionizing radiation may be from 10 to 100 Gy, with 15 to 75 
Gy being preferred, and 20 to 50 Gy being more preferred. 

Any suitable means for delivering radiation to a tissue may be employed in the 
10 present invention in addition to external means. For example, radiation may be delivered 
by first providing a radiolabeled antibody that immunoreacts with an antigen of the 
tumor, followed by delivering an effective amount of the radiolabeled antibody to the 
tumor. In addition, radioisotopes may be used to deliver ionizing radiation to a tissue or 
cell. 

15 

c. In Vitro Scanning Mutagenesis 
Random mutagenesis also may be introduced using error prone PCR (Cadwell 
and Joyce, 1992). The rate of mutagenesis may be increased by performing PCR in 
multiple tubes with dilutions of templates. 

20 One particularly useful mutagenesis technique is alanine scanning mutagenesis in 

which a number of residues are substituted individually with the amino acid alanine so 
that the effects of losing side-chain interactions can be determined, while minimizing the 
risk of large-scale perturbations in protein conformation (Cunningham et al, 1989). 

In recent years, techniques for estimating the equilibrium constant for ligand 

25 binding using minuscule amounts of protein have been developed (Blackburn et al 9 
1991; U.S. Patents 5,221,605 and 5,238,808). The ability to perform functional assays 
with small amounts of material can be exploited to develop highly efficient, in vitro 
methodologies for the saturation mutagenesis of antibodies. The inventors bypassed 
cloning steps by combining PCR mutagenesis with coupled in vitro 

30 transcription/translation for the high throughput generation of protein mutants. Here, the 
PCR products are used directly as the template for the in vitro transcription/translation of 
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the mutant single chain antibodies. Because of the high efficiency with which all 19 
amino acid substitutions can be generated and analyzed in this way, it is now possible to 
perform saturation mutagenesis on numerous residues of interest, a process that can be 
described as in vitro scanning saturation mutagenesis (Burks et aL> 1997). 
5 In vitro scanning saturation mutagenesis provides a rapid method for obtaining a 

large amount of structure-function information including: (i) identification of residues 
that modulate ligand binding specificity, (ii) a better understanding of ligand binding 
based on the identification of those amino acids that retain activity and those that abolish 
activity at a given location, (iii) an evaluation of the overall plasticity of an active site or 
10 protein subdomain, (iv) identification of amino acid substitutions that result in increased 
binding. 

d. Random Mutagenesis by Fragmentation and Reassmbly 
A method for generating libraries of displayed polypeptides is described in U.S. 
15 Patent 5,380,721. The method comprises obtaining polynucleotide library members, 
pooling and fragmenting the polynucleotides, and reforming fragments therefrom, 
performing PCR amplification, thereby homologously recombining the fragments to form 
a shuffled pool of recombined polynucleotides. 

20 ii) Site-Directed Mutagenesis 

Structure-guided site-specific mutagenesis represents a powerful tool for the 
dissection and engineering of protein-ligand interactions (Wells, 1996, Braisted et aL 9 
1996). The technique provides for the preparation and testing of sequence variants by 
introducing one or more nucleotide sequence changes into a selected DNA. 

25 Site-specific mutagenesis uses specific oligonucleotide sequences which encode 

the DNA sequence of the desired mutation, as well as a sufficient number of adjacent, 
unmodified nucleotides. In this way, a primer sequence is provided with sufficient size 
and complexity to form a stable duplex on both sides of the deletion junction being 
traversed. A primer of about 17 to 25 nucleotides in length is preferred, with about 5 to 

30 10 residues on both sides of the junction of the sequence being altered. 
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The technique typically employs a bacteriophage vector that exists in both a 
single-stranded and double-stranded form. Vectors useful in site-directed mutagenesis 
include vectors such as the Ml 3 phage. These phage vectors are commercially available 
and their use is generally well known to those skilled in the art. Double-stranded 
5 plasmids are also routinely employed in site-directed mutagenesis, which eliminates the 
step of transferring the gene of interest from a phage to a plasmid. 

In general, one first obtains a single-stranded vector, or melts two strands of a 
double-stranded vector, which includes within its sequence a DNA sequence encoding 
the desired protein or genetic element. An oligonucleotide primer bearing the desired 

10 mutated sequence, synthetically prepared, is then annealed with the single-stranded DNA 
preparation, taking into account the degree of mismatch when selecting hybridization 
conditions. The hybridized product is subjected to DNA polymerizing enzymes such as 
E. coli polymerase I (Klenow fragment) in order to complete the synthesis of the 
mutation-bearing strand. Thus, a heteroduplex is formed, wherein one strand encodes the 

15 original non-mutated sequence, and the second strand bears the desired mutation. This 
heteroduplex vector is then used to transform appropriate host cells, such as E. coli cells, 
and clones are selected that include recombinant vectors bearing the mutated sequence 
arrangement. 

Comprehensive information on the functional significance and information 
20 content of a given residue of protein can best be obtained by saturation mutagenesis in 
which all 19 amino acid substitutions are examined. The shortcoming of this approach is 
that the logistics of multi-residue saturation mutagenesis are daunting (Warren et aL, 
1996, Brown et al, 1996; Zeng et al 9 1996; Burton & Barbas, 1994; Yelton et al. 9 1995; 
Jackson et al, 1995; Short et al., 1995; Wong et al 9 1996; Hilton et al 9 1996). 
25 Hundreds, and possibly even thousands, of site specific mutants must be studied. 
However, improved techniques make production and rapid screening of mutants much 
more straightforward. See also, U.S. Patents 5,798,208 and 5,830,650, for a description 
of "walk-through" mutagenesis. 

Other methods of site-directed mutagenesis are disclosed in U.S. Patents 
30 5,220,007; 5,284,760; 5,354,670; 5,366,878; 5,389,514; 5,635,377; and 5,789,166. 
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3. Chip Technology 

Specifically contemplated by the inventor is the use of high affinity EBMs as 
capture agents in chip-based technologies. These techniques involve quantitative 
methods for analyzing large numbers of molecules rapidly and accurately. U.S. Patent 
5 5,143,854 describes large scale photolithographic solid phase synthesis of polypeptides 
and receptor binding screening thereof. U.S. Patents 5,807,522 and 6,110,426 both 
provides methods for the fabrication of microarrays for biological samples. These 
technologies as well as related ones (Affymax) provide for the screening or large 
numbers of peptides or nucleic acids using automated and high throughput methods. 
10 Since the method discussed in this application will lead to the rapid discovery of large 
numbers of chemically synthesizable chemical ligands (as opposed to antibodies or other 
proteins that must be produced biologically), it is expected to fill an important need in the 
development of a commercial protein chip industry. 

15 H. Substrate Directed Inhibition 

Enzymes that covalently modify other proteins, such as proteases, kinases and 
phosphorylases, are very important pharmaceutical targets, since these modifications are 
often at the heart of signal transduction pathways that control many cellular processes. 
For example, the clinically used immunosuppressants cyclosporin A and FK-506 inhibit 

20 the phosphatase calcineurin in T-cells, blocking the dephosphorylation of transcription 
factor NF-AT, thereby inhibiting transcription of genes important in generating an 
immune response. Another example is the new class of anti-HIV drugs which bind to the 
HIV protease and prevent it from converting inactive viral polyproteins to their active, 
mature products. In fact, almost all known protease, kinase and phosphatase inhibitors 

25 target the enzyme itself, either directly and indirectly. 

The inventor has developed a concept called substrate-directed inhibition (FIG. 
8). The rationale is to discover compounds that bind to the segment of the substrate 
protein which is cut by the protease or phosphorylated or dephosphorylated by the kinase 
or phosphatase, respectively. Substrate-targeted inhibitors will have two very important 

30 advantages over traditional enzyme-targeted inhibitors. First, for enzymes that have a 
number of different substrates, one can block processing of a single substrate, but not 

1663250.1 

-48- 



others. Traditional inhibitors necessarily block all reactions catalyzed by the target 
enzyme. Since substrate-targeted inhibitors will be more specific, they may have fewer 
side effects if some activities of the enzyme are physiologically beneficial. Second, 
enzyme-targeted inhibitors often lose their potency over time due to mutations which 
5 allow the enzyme to escape binding to the inhibitor, but which have no effect on catalytic 
activity. A mutation which allowed the substrate to evade binding an inhibitor targeted to 
it would very likely also abolish enzyme-substrate recognition. Therefore, substrate- 
based inhibitors will probably have longer useful lifetimes, particularly in anti-bacterial 
and anti-viral applications, where genomic mutation can occur rapidly. 

10 A general system to discover substrate-targeted inhibitors using the X Repressor- 

based system is described. The peptide sequence recognized by the enzyme (i.e., the 
protease cleavage site, kinase phosphorylation site, etc.) is fused to the Repressor DNA- 
binding domain (DBD) and a peptide library fused to another copy of the Repressor 
DNA-binding domain is scanned for library-encoded peptides that bind the target peptide 

15 with high affinity and specificity (bottom of FIG.. 1). The library-encoded peptides are 
then synthesized and evaluated as substrate-targeted inhibitors. These peptides may serve 
as lead compounds for the development of non-peptidic small molecule analogues, or 
may be used directly as drugs if tethered to a molecule that can deliver them into cells. 

20 I. Examples 

The following example is included to demonstrate preferred embodiments of the 
invention. It should be appreciated by those of skill in the art that the techniques 
disclosed in the example which follows represent techniques discovered by the inventor 
to function well in the practice of the invention, and thus can be considered to constitute 

25 preferred modes for its practice. However, those of skill in the art should, in light of the 
present disclosure, appreciate that many changes can be made in the specific 
embodiments which are disclosed and still obtain a like or similar result without 
departing from the spirit and scope of the invention. 

The pro-IL-ip-derived epitope (ICS) was chosen for the reason that it includes 

30 one of the two sites cleaved by interleukin converting enzyme (ICE) during the hormone 
maturation process (Dinarello, 1998). If these cleavages do not occur, the hormone 
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remains inactive. Since the inception of this project and well before Nestler's report of 
the inhibition of Ras farnesylation (Dong et al., 1999), we had hoped that epitope-binding 
molecules could be used as "protecting groups" for sites of post-translational 
modification in proteins such as proteolysis, phosphorylation, glycosylation, etc. What 
5 would be the advantage of such an approach over the use of traditional enzyme-targeted 
inhibitors? The vast majority of proteases, kinases, etc. have multiple cellular substrates. 
Of course, traditional enzyme-targeted inhibitors block the processing of all substrates 
acted upon by the enzyme. However, a substrate-targeted inhibitor holds out the 
possibility that processing of a single substrate could be blocked without affecting other 

10 reactions mediated by the same enzyme. Such a strategy might be advantageous if 
blocking one event mediated by a given enzyme had some therapeutic benefit, but 
blocking another resulted in toxicity. As a tool for basic research, such inhibitors could 
prove valuable in assessing the physiological consequences of various different post- 
translational modifications mediated by an enzyme with multiple substrates. Another 

15 scenario in which a substrate-targeted inhibitor might be useful is when the enzyme that 
mediates the post-translational modification event is unknown, making it difficult to 
develop classical inhibitors. 

Substrate-targeted inhibition could be achieved if the epitope-binding molecule 
recognized not only the residues required for the post-translational modification event, 

20 but also flanking residues unique to a given substrate. To test the feasibility of this idea, 
we constructed two different model substrates for ICE. Each was a fusion protein in 
which an N-terminal GST protein was linked to a C-terminal protein (GFP in "GG" and 
MBP in "GM", see FIG. 8). The linker in each case included a sequence known to be 
cleaved by ICE. In GG, the linker is the IL-lp-derived epitope against which LEPB was 

25 isolated. In GM, the proteins are linked by a sequence found in a different ICE substrate 
that shares little sequence homology with ICS other than the Asp in the cleavage site (see 
FIG. 8). Actually, this is a site in immature human ICE (activation is autocatalytic). 
Purified ICE cuts both the GG and GM substrates efficiently under these conditions. 
When synthetic LEPB was titrated into the reaction however, a progressive inhibition of 

30 the cleavage of GG, which contained the ICS epitope was observed (FIG. 9). The IC 50 
was somewhere between 50-100 \iM. However, LEPB addition had no detectable affect 
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on GM cleavage. A control peptide taken at random from the library had no effect on 
either reaction (data not shown). It is noteworthy that the IC50 measured in this 
experiment is well above the measured 2 ^iM K D of the LEPB-ICS complex. This 
probably reflects the fact that the peptide must compete with protease for the GG 
5 substrate. Alternatively, the K D was measured in a different context and while context is 
not qualitatively critical for binding, it probably modulates the association constant. In 
particular, this is the first experiment in which we have employed a peptide with a free N- 
terminus. It is conceivable the lack of any N-terminal extension could reduce binding. 

This exciting result clearly demonstrates the feasibility of substrate-targeted 
10 inhibition of post-translational modification. This is, to the best of the inventor's 
knowledge, the first demonstration of a substrate-targeted protease inhibitor. 

5jc sjc «$c *$c jflc 5J1 jjc sjs sjt sfc jjc *£c #£c s]c sfc s|c sjc *Jc #|* *jc 

1 5 All of the COMPOSITIONS and METHODS disclosed and claimed herein can be 

made and executed without undue experimentation in light of the present disclosure. 
While the compositions and methods of this invention have been described in terms of 
preferred embodiments, it will be apparent to those of skill in the art that variations may 
be applied to the COMPOSITIONS and METHODS and in the steps or in the sequence 

20 of steps of the method described herein without departing from the concept, spirit and 
scope of the invention. More specifically, it will be apparent that certain agents which 
are both chemically and physiologically related may be substituted for the agents 
described herein while the same or similar results would be achieved. All such similar 
substitutes and modifications apparent to those skilled in the art are deemed to be within 

25 the spirit, scope and concept of the invention as defined by the appended claims. 
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